Performance of Scalabale Shared-Memory Architectures

نویسندگان

Bahman S. Motlagh

Ronald F. DeMara

چکیده

Analytical models were developed and simulations of memory latency were performed for Uniform Memory Access (UMA), Non-Uniform Memory Access (NUMA), Local-RemoteGlobal (LRG), and RCR architectures for hit rates from 0.1 to 0.9 in steps of 0.1, memory access times of 10 to 100 ns, proportions of read/write access from 0.01 to 0.1, and block sizes of 8 to 64 words. The RCR architecture provides favorable performance over UMA and NUMA architectures for all ranges of application and system parameters. RCR outperforms LRG architectures when the hit rates of the processor cache exceed 80% and replicated memory exceed 25%. Thus, inclusion of a small replicated memory at each processor significantly reduces expected access time since all replicated memory hits become independent of global traffic. For configurations of up to 32 processors, results show that latency is further reduced by distinguishing burst-mode transfers between isolated memory accesses and those which are incrementally outside the working set.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Performance Evaluation of Or-Parallel Logic Programming Systems on Distributed Shared-Memory Architectures

Distributed shared-memory (DSM) architectures have been object of research by many computer science groups. In this work, we investigate how DSM architectures aaect performance of or-parallel logic programming systems and how this performance approaches that of conventional C systems. Our work concentrates on basic performance, scal-ability, and programmability.

متن کامل

Next Generation Internet High-Speed Switches and Routers

Shared memory architecture for packet switches was normally thought to be unsuitable for building high performance switches/routers. The main reason lies in their perceived poor scalability. In particular, shared memory architectures are typically used to build output-queued switches which are regarded as the best candidate to achieve optimal delay-throughput performance. The current trend in r...

متن کامل

An Analysis of Three Transaction Processing Architectures

In this paper, we investigate the issues involved in using multiprocessors for high performance transaction processing applications. We use a simulation model to compare the performance of three different architectures, namely, Shared Everything, Shared Nothing and Shared Disks. In Shared Everything, any processor can access any disk and all memory is shared. In Shared Nothing, neither disks no...

متن کامل

Design of a novel congestion-aware communication mechanism for wireless NoC architecture in multicore systems

Hybrid Wireless Network-on-Chip (WNoC) architecture is emerged as a scalable communication structure to mitigate the deficits of traditional NOC architecture for the future Multi-core systems. The hybrid WNoC architecture provides energy efficient, high data rate and flexible communications for NoC architectures. In these architectures, each wireless router is shared by a set of processing core...

متن کامل

Eecient Barriers for Distributed Shared Memory Computers University of Colorado at Boulder Eecient Barriers for Distributed Shared Memory Computers

Barrier algorithms are central to the performance of numerous algorithms on scalable, high-performance architectures. Numerous barrier algorithms have been suggested and studied for Non-Uniform Memory Access (NUMA) architectures, but less work has been done for Cache Only Memory Access (COMA) or attraction memory 2] architectures such as the KSR-1. In this paper, we present two new barrier algo...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

Journal of Circuits, Systems, and Computers

دوره 10 شماره

صفحات -

تاریخ انتشار 2000

Performance of Scalabale Shared-Memory Architectures

نویسندگان

چکیده

منابع مشابه

Performance Evaluation of Or-Parallel Logic Programming Systems on Distributed Shared-Memory Architectures

Next Generation Internet High-Speed Switches and Routers

An Analysis of Three Transaction Processing Architectures

Design of a novel congestion-aware communication mechanism for wireless NoC architecture in multicore systems

Eecient Barriers for Distributed Shared Memory Computers University of Colorado at Boulder Eecient Barriers for Distributed Shared Memory Computers

عنوان ژورنال:

اشتراک گذاری